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Abstract 

We present an approximate distance oracle for a point set S with n points and doubling dimension A. 
For every e > 0, the oracle supports (1 + e)-approximate distance queries in (universal) constant time, 
occupies space [£-° (A) + 2° (Alog A) ]n, and can be constructed in [2° (A) log 3 n + e^° (x) + 2° (AlogA) ]n 
expected time. This improves upon the best previously known constructions, presented by Har-Peled 
and Mendel |13j . Furthermore, the oracle can be made fully dynamic with expected O(l) query time 
and only 2° (A) logn + e~° w + 2° (AlogA ) update time. This is the first fully dynamic (1 + e)-distance 
oracle. 
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1 Introduction 



A distance oracle for a set of n points S under some distance function d(-,-), is a preprocessed data 
structure that given two points x,y € S returns their distance without needing to query the distance 
function. Distance oracles are of interest when the distance function is too large to store (for example 
when the function is a distance matrix storing all 0(n 2 ) interpoint distances) or when querying the distance 
function is expensive (for example when the distance function is defined by graph-induced distances). 

Distance oracles were introduced in a seminal paper of Thorup and Zwick [27]. For a weighted undirected 
graph, they gave an (2k — l)-approximate distance oracle with query time 0(k), for k > 1. Immediate from 
these runtimes is the question of reduced query time, and in fact Mendel and Naor [18] recently presented 
a 0(A:)-approximate distance oracle for general metrics with O(l) query time. Another direction for 
improvement is in the approximation guarantee, and distance oracles with (l+e)-approximation (0 < e < |) 
have been achieved for planar graphs [26] , geometric graphs [11] and doubling spaces [13] . 

A further question in this field is that of dynamic distance oracles. In this setting, the point set S is 
updated with the removal or addition of points, and the distance oracle must be updated accordingly. A 
similar paradigm was considered by [23], who gave a distance oracle for an unweighted undirected graph 
under the removal of edges. Here, the distance function is the shortest path metric of the underlying graph, 
which must be consulted during an oracle update. 

In this paper we consider a metric space with doubling dimension A, and present a (1 + e)-approximate 
distance that answers queries in (universal) constant time. The distance oracle occupies near-optimal 
space [e~° (A) + 2° (AlogA) ]ra, and can be constructed in [2° (A) log 3 n + e~° w + 2° (AlogA) ]n expected time. 
This improves upon the best previously known constructions in this setting, presented by Har-Peled and 
Mendel [13]. Furthermore, this oracle can be made fully dynamic with expected O(l) query time. In this 
case, the update time is only 2°( A ) logn + e~°^ + 2°( AlogA ) time per point. 



Related work. Thorup and Zwick [27] demonstrated that a weighted undirected graph can be prepro- 
cessed to create an oracle that can answer (2k — l)-approximate distance queries between any two vertices 
in 0(k) time. The structure is of size 0(n l+l l k ), and the randomized preprocessing takes Ofan 1 ^) time 
(where n is the number of vertices and m is the number of edges). Ro ditty, Thorup and Zwick [23] gave a de- 
terministic preprocessing algorithm that builds the distance oracle in 0(jnn}l k ) time. Baswana and Sen [S] 
and Baswana and Kavitha [3J improved the deterministic preprocessing time to 0(min(m- v /n, kn 2+1 l k )). 
Mendel and Naor |18] showed that for any metric space there exists an 0(/c)-approximate distance oracle 
of size 0(n l+1 / k ) that supports queries in constant time independent of k. 

Turning to lower bounds, Thorup and Zwick [27] proved that any (2/E + l)-approximate distance oracle must 
have size at least min(m, £l(n 1+l l k )). Very recently, Sommer, Verbin, and Yu [25] extended a technique of 
Patra§cu |21] to prove that a ^-approximate distance oracle preprocessed in t time must occupy r^+^i 1 /^) 
space. 

While the previous results apply to arbitrary metric space, distance oracles have also been studied for 
more restricted settings. Klein [15] and Thorup [26] considered planar graphs, and showed how to build a 
(1 + e)-distance oracle with 0((n log n)/e) space and 0(e~ l ) query time. (Thorup [26] presented an oracle 
for directed planar graphs.) Gudmundsson, Levcopoulos, Narasimhan and Smid [11] considered geometric 
(Euclidean) graphs that are t-spanners for some constant t > 1. (A graph G = (S,E) is said to be a 
i-spanner for S, if for any point pair p,q £ S, there exists in G a path connecting p and q, and the length 
of this path is at most t times the true distance between p and q.) They showed how to construct a (1 + e)- 
approximate distance oracle of size 0((^) d n\ogn). Their oracle can be constructed in ( £ ^ t t _ 1 ^ )° ( - d ^n\ogn 

time, and answers distance queries in 0((^) d ) time. Har-Peled and Mendel [13] considered metric spaces 
with low doubling dimension. They presented two data structures both of size e~°^n (which attains 
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Table 1: A summary of (1 + e) distance oracles. *In expectation. 



the lower-bound on the space required for this task). Their first data structure can be constructed in 
2°^nlog 2 n time and answers (1 + e)-approximate distance queries in 2°^ time. Their second data 
structure can be constructed in polynomial time and answers (1 + ^-approximate distance queries in 0(A) 
time. 

Our contribution. Our result improves the query time from 0(A) in the construction of Har-Peled 
and Mendel [13] to constant time, while also providing the first fully dynamic oracle construction. As in 
Har-Peled and Mendel [13], an immediate application of our static oracle for S is a (1 + e)-approximate 
distance oracle for every graph which is a t-spanner of 5. Our static oracle is a dramatic improvement over 
those of Gudmundsson et al. [11] reducing the query time to constant in several aspects, while the space 
is smaller by a factor of log n and the setting is more general. 

To obtain our improved bounds, we present contributions in several distinct areas, including dynamic 
embeddings and dynamic tree structures. We present two probabilistic dynamic embeddings for doubling 
spaces: The first is into a tree metric, and the second is a snowflake embedding into Euclidean space (see 
Section [673]). In both cases, we are interested in the probability of low distortion, as opposed to expectation. 
This seems to be the first consideration of dynamic embeddings (although the related notion of on-line 
embeddings recently appeared in [E]). We also present a powerful dynamic tree structure that allows a 
binary search over centroid paths in our setting (see Section [5.2p . Our oracle framework and the tools used 
in this paper further imply other distance oracles with various tradeoffs. A brief summary of these results 
is found in Table [TJ 

Paper outline. The rest of this paper is organized as follows. We first present preliminary points in 
the next section. We then describe (in Section [3j the basic structure that forms the backbone of most 
of our constructions. We proceed to present the central contribution of this paper, the O(l) query time 
oracles (both static and dynamic) in Section 0] The dynamic oracle requires two dynamic backup oracles, 
which are separate constructions of independent interest (presented in Section [5]), and both oracles require 
several technical contributions (presented in Section [6]). 

2 Preliminaries 

Here we review some preliminary definitions and results that are required in order to present our new ideas. 
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Lowest common ancestor query. A lowest common ancestor (LCA) query on tree T provides two 
nodes u, v of T, and asks for the node w that is ancestral to u and v, and is minimal in the sense that 
no descendant of w is ancestral to both u and v. LCA queries can be answered in O(l) time in the word 
RAM model, using a linear size data structure [7] . 

In the dynamic setting, Cole and Hariharan [7j gave a linear size data structure that supports LCA queries 
under insertions and deletions of leaves and internal nodes to the tree. The query and update times are 
all O(l) under the word RAM model. We can extend their structure to also identify in 0(1) time the two 
children of w that are ancestors of x and y (Lemma 1 12ft. 

Level ancestor query. A level ancestor query on tree T provides a node u and level k, and asks for the 
node w that is both an ancestor of u and is k nodes removed from the root of T. There exists a linear size 
structure that supports level ancestor queries in 0(1) time. 

In the dynamic setting, there exists a structure that supports level ancestor queries in O(l) search and 
update time under insertions of leaves into T. However, the insertion of internal nodes is not supported 
by this structure [21 HE]. For the purposes of this paper, we must maintain a tree under the insertions of 
internals nodes, hence we are unable to utilize standard level ancestor query structures in our dynamic 
setting. 

Doubling dimension. For a metric (X,d), let A be the smallest value such that every ball in X can be 
covered by 2 A balls of half the radius. The doubling dimension of X is dim(X) = A. A metric is doubling 
when its doubling dimension is constant. Note that while low Euclidean dimension implies low doubling 
dimension (Euclidean metrics of dimension d have doubling dimension 0{d) |12j). low doubling dimension 
is more general than low Euclidean dimension. The following property can be demonstrated via a repetitive 
application of the doubling property. 

Property 1 (Packing property) For set S with doubling dimension \, if the minimum interpoint dis- 
tance in S is at least a, and the diameter of S is at most b, then \S\ < 2°( Alog ( fe//a )) . 

Hierarchical Partitions. Similar to what was described in [3[T7], a subset of points X C Y is an (r, s)- 

discrete center set (or net in the terminology of [T7]) of Y (r < s) if it satisfies the following properties: 

(i) Packing: For every x, y £ X, d(x, y) > r. 

(ii) Covering: Every point y £ Y is strictly within distance s of some point x £ X: d(x, y) < s. 

The previous conditions require that the points of X be spaced out, yet nevertheless cover all points of Y. 

A hierarchical partition for a set S is a hierarchy of discrete center sets, where each level of the hierarchy 
is a discrete center set of the level beneath it. Krauthgamer and Lee [T7] gave a fully dynamic hierarchy 
that can be updated in time 2°^ log a (a is the aspect ratio of S), where a single update to S can result 
in 2°^ logo; updates to the hierarchy. Cole and Gottlieb [6] presented a semi-dynamic hierarchy, where a 
single insertion into S can result in the insertion of 2°^ points into the hierarchy. However, points cannot 
be removed from within the hierarchy, and after many deletions the hierarchy is rebuilt in the background 

x It suffices, if the hierarchy holds n' nodes (included those nodes storing deleted points), to start rebuilding after ^ 
deletions, and to complete the rebuilding over the next ^- insertions and deletions; that is, for each update to the point set 7 
updates are performed on the background structure. The completed hierarchy will then contain at least ^- points, including 
at most ^l- deleted points. 
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Our constructions can make use of either hierarchy, but our descriptions will assume the hierarchy of [6] . 
The bottom level of this hierarchy is the set Y\ = Y 5 o = S that contains all points, and the top level 
^riogsQl contains only a single point. (For ease of presentation, we assume throughout that the minimum 
interpoint distance in S is 1.) Each intermediate level < i < |~log 5 a] is represented by a set Y 5 i, which 
is a (^5*, |5*)-discrete center set for Y 5 i-i. The radius of level Y^m is defined to be 5 m . A point y G Y^m 
6-covers a point x G Y 5 i if d(y, x) < b-5 l , and the covering property states that each points in the hierarchy 
is |-covered by some point one level up. It can be shown by a repeated application of the covering property 
that each point is |-covered by some point in every higher level. 

The hierarchy may be augmented with neighbor links: Each point x G Y^m records what points of Y^m are 
within distance b ■ 5 m of x - these are the 6-neighbors of x. By the packing property, a point may have 
b°W such neighbors. To save space in the hierarchy, points that have no fe-neighbors (where b > 2) and 
also cover only one point in the next level may be represented implicitly. This compression scheme ensures 
that a hierarchy and neighbor links can be stored with 6°Wn space. 

Snowflake embedding. Assouad's [3] snowflake embedding - as improved by Gupta et al. [12] - takes 
an arbitrary metric (X, d) of doubling dimension A (where d is the metric's distance function), and embeds 
the snowflake (X, d 1 / 2 ) into 0(Alog A)-dimensional Euclidean space with 0(A) distortion. That is, the 
embedding into Euclidean space achieves low dimension and distortion, but has the 'side effect' that every 
interpoint distance in the metric is replaced by its square root, with distortion to the square root at most 
O(A). Har-Peled and Mendel [13] used this embedding in the context of distance oracles. 

Although [12] did not give an exact run time for their static embedding, the following analysis holds: The 
static embedding can be achieved by first building a point hierarchy that records 0(A)-neighbors; this can 
be done in [2°( A ) min{logn,loga} + 2°( AlogA )]n time [HI USE]. The analysis in [12] requires a constructive 
application of the Lovasz Local Lemma [2D], which in this case can be done in 2°( AlogA )n expected time. 
Given these constructions, the image for each point can easily be computed in 2°( Alog x ^n log a time, but 
a more careful analysis shows that 2°( AlogA ) work per hierarchical point is sufficient. It follows that the 
entire construction can be done in [2°( A ) minjlog n, log a} + 2°( AlogA )]n expected time. The construction 
of [12] is static, and so for our purposes we will need to create a dynamic version of the embedding (see 
Section [OjZ]). 

3 Construction backbone 

The backbone of our distance oracles is a point hierarchy, and we shall employ the semi-dynamic hierarchy 
of Cole and Gottlieb [6] augmented with storage of c-neighbor pairs, and the distance between the neighbors 
in a pair. We will show below that c = ^ (0 < e < ^) is an appropriate choice. On top of this hierarchy, 
we define a parent-child relationship as follows: every point x £ Y 5 i is a child of some point y G Y 5 i+i that 
|-covers x. This implies that points that are siblings must be 6-neighbors. The parent-child relationship 
immediately defines an ancestor-descendant relationship as well. (Note that some other constructions 
in this paper, such as the dynamic embeddings of Section 16.31 will require a different definition of the 
parent-child relationship.) We have the following property: 

Property 2 (Hereditary property) If two points x,y £ Y 5 i are c-neighbors then their respective parents 
x', y' G Y 5 i+i are c-neighbors as well. 

That is, if two points x,y G Y 5 i have the property that d(x,y) < c5*, then their respective parents 
x',y' G Y 5 i+i have the property that d(x',y') < d(x',x) + d(x,y) + d(y,y') < |5 m + c5 i + §5 m = 
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§5* +1 + c5* = (^^)5* +1 < c5* +1 . This means that x,y and all their ancestors up to their lowest common 
ancestor are all found explicitly in the hierarchy. 

The hierarchical tree T is extracted from the hierarchy. T has one node per hierarchical point, and the 
points of hierarchical level Y 5 i all have corresponding nodes in tree level Tj. The parent-child relationship 
among points in the hierarchy defines the same parent-child relationship among the corresponding tree 
nodes. We will refer to the distance between two tree nodes, by which we mean the distance between their 
corresponding points. Further, we compress all nodes whose points are only implicit in the hierarchy; this 
results in the contraction of some unary paths. This tree will allow us to navigate the hierarchy. 



Structural lemmas. Here we present lemmas that will be used to prove correctness of our oracles. The 
key observation motivating our constructions is captured by the following lemma, a variant of which was 
central for the construction of low stretch spanners [HJ [22| [£l [10| . While we state the lemmas in term of 
general 6, we are actually interested in the two cases where 6 = 6 and 6 = c = 

Theorem 1 Let x, y 6 Y 5 i be a pair that are not b-neighbors, and let x', y' £ Y^m in some level m > I be 
the lowest respective ancestors of x and y that are b-neighbors. Then d(x',y') is a (1 ± ^-approximation 
to d(x, y). 



Proof: The fact that x and y are not 6- neighbors implies that d(x,y) > 65', while the fact that x 1 and y' 
are 6-neighbors implies that d(x',y') < 65 m . The parent-child relationship implies that d(x,x'),d(y,y') < 
Y,?=i+i f • 5 m < f • 5 m . It follows that d(x, y) < d(x, x') + d(x', y') + d{y', y) < d{x', y') + §5 m < d{x', y') + 
Mx',y f ) = (1 + f b )d(x',y>). Also, d(x,y) > d(x',y>) - d(x,x') - d(y',y) < d{x> ,y>) - §5 m < d(x',y') - 
*- b d{x\y') = {l-l E )d(x>,yi). □ 

When 6 = 6 we have that d(x',y') is a (1 ± ^-approximation for d(x,y), and when 6 = c we have that 
d(x' , y') is a (1 ± ^-approximation for d(x, y). Hence, the problem of finding a (1 + ^-approximation for 
d(x, y) can be solved by finding the lowest ancestral c-neighbors of x and y in the hierarchy. Later, we will 
also make use of the following corollary. 



Corollary 2 A 5 -approximation for d(x' , y') implies a 5(1 ± ^-approximation for d(x, y), and vice versa. 

We have proved that finding the lowest ancestral c-neighbors of x and y in the hierarchy will provide a 
(1 + ^-approximation to d(x, y). The following lemma demonstrates a close relationship between the value 
of d(x,y), and the level in which the lowest ancestral 6-neighbors x',y' of x,y are found. 

Lemma 3 Let i be such that 5 i_1 < d(x,y) < 5*. Let x' and y' , be the respective ancestors of x and y in 
level Y$p . 

(i) If p < i — 1 — log 5 (6 + |), then x' and y' are not b-neighbors. 
(ii) If p> i — log 5 (6 — f ), x' and y' must be either b-neighbors or the same point. 

Proof: (i) We have that d(x' , y') > d(x, y) - d(x' , x) - d(y, y') > 5 i_1 - 2§ • 5 P = - f • 5 P . Note that 
for values p < i — 1 — log 5 (6 + |), we have that d(x', y') > (6 + |)5 P — | • 5 P = 6 • 5 P , and x" and y" cannot 
be 6-neighbors. 

(ii) We have that d(x', y') < d(x, y) + d(x', x) + d(y, y') < 5* + | • 5 P . Note that for values p > i — log 5 (6 — |), 
we have that d(x', y') < (6 — |)5 P + | • 5 P = b ■ 5 P , and x' and y' must be 6-neighbors or the same point. □ 
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Lemma [3] implies that the level of the lowest ancestral b- neighbors x',y' is in the range [i — 1 — log 5 (6 + 
|), i — log 5 (6— |)], and this range is of size O(l) irrespective of the value of b. Crucially, this means that the 
levels of the lowest ancestral 6-neighbors of x, y and the lowest ancestral c-neighbors of x, y differ by a fixed 
value (logc), up to an additive constant. This implies that finding the lowest common 6-neighbors of x,y 
is a useful tool to find their lowest common c-neighbors, and therefore a (1 + ^-approximation to d(x,y). 
A further consequence of Lemma [3] is that a 5-approximation for d(x,y) (or in fact for any descendants of 
x',y') is sufficient to pinpoint the level of the lowest ancestral 6-neighbors x',y' to a range of log 5 + 0(1) 
possible levels. 

Deletions. In closing this section, we note that for all dynamic structures presented in this paper, 
deletions are handled by rebuilding in the background (as was described in Section [2] in the context of 
dynamic hierarchies): Deleted points are kept in the structure, and when a large number of points have 
been deleted, we begin to rebuild the structure in the background. This has no effect on the asymptotic 
runtimes of our constructions. 

4 Oracle queries in O(l) time 

In this section we present (1 + e)-approximate distance oracles with 0(1) query time and size [e~°^ + 
20(Aiog A)j n _ rp^g £ rg j. orac j e we p resen t i s static, and the second is a dynamic version of the static con- 
struction. In Section 14.31 we briefly discuss variants of these constructions that appear in Table [T] 

4.1 Static oracle 

In this section we prove the following theorem: 

Theorem 4 There exists a static (1 + e) approximate distance oracles with O(l) query time and size 
[ e -0(A) + 2 0(Aio g A)] n _ The omc i e can be U pdated in expected time [2°W log 3 n + e~°^ + 2°( Alo s A )]n. 

Given points x and y, the oracle finds in O(l) time the lowest ancestral 6-neighbors x',y' of x,y. As a 
consequence of Lemma [3l the level of the lowest ancestral 6-neighbors of x,y gives the level of the lowest 
ancestral c-neighbors of x, y to within an additive constant. The level of the c-neighbors can then be found 
using a constant number of level ancestor queries. 

The oracle locates x' , y' in three steps, each of which can be implemented in O(l) time: In the first step we 
compute an O(logn) approximation to d(x,y). As a consequence of Lemma[3j this approximation restricts 
the candidate level of x', y' to a range of 0(log log n) possible levels. The second step then derives an 0(A 3 ) 
approximation to d(x,y), which further restricts the candidate level to O(logA) possible levels. The third 
step locates x',y'. 

Step 1. The first step provides an 0(log n)-approximation for d(x,y), which implies a O(logn) approxi- 
mation for d(x',y') (by Corollary [2|) . First note that for doubling metrics, there exists a 6-stretch spanner 
with 2°( x ^n edges that can be constructed in time 2°^nlogn [HI [10]. Given this spanner, we can con- 
struct the oracle of Mendel and Schwab \19\ Theorem 2(2)] with parameter k = O(logn), which yields 
an 0(log ?i)-approximate distance oracle of size O(n) that supports distance queries in 0(1) time, with 
expected construction time 2°^nlog 3 n. We construct the oracle in the preprocessing stage, and derive 
an 0(log n)-approximation for d(x,y) - and therefore for d(x',y') - in O(l) time. 



6 



Step 2. The second step gives an 0(A 3 )-approximation to d(x', y'), assuming an 0(log n)-approximation 
is already known. If A > log 1//3 n, this step is unnecessary and is skipped. We therefore assume that 
A < log 1/3 n. 

Recall that the O(logn) approximation to d(x',y') restricts the candidate levels of x',y' to a range of 
r = O(loglogn) levels (Lemma [3]). The ancestors of x and y in the top level of this range can be located 
via a level ancestor query on x and the desired number of levels below lca(x, y) (assuming that we have 
recorded for every node in T its distance from the root). But the task of locating the ancestors of x,y 
in the bottom level of this range is frustrated by the fact that some ancestors of x, y below x', y' may be 
compressed (if these nodes are below the lowest ancestral c-neighbors of x,y); these uncompressed nodes 
will be ignored by the level ancestor query, which will therefore return an incorrect level. To solve this 
problem, we preprocess a loglogn-jump tree for T (see Section l6.2p . A series of log log re-jump queries 
locate in 0(1) time explicit ancestors of x,y that are at most log log re levels below x',y', which will suffice 
for our purposes. Call these ancestors x",y" - By Corollary [51 an 0(A 3 )-approximation to d(x",y") yields 
an 0(A 3 )-approximation to d(x',y'). 

Now, for every node u S Tj.i og i ogn , i > 0, (including implicit nodes) let the neighbor set N u contain 
all explicit nodes that are descendants of u and it's 6- neighbors, in r + 2 log log??, levels below Tj. We 
preprocess the snowflake embedding for each non-empty neighborhood, which can be done in total time 
[2°( A ) logra + 2°( AlogA )]n (since each explicit node participates in 2°^ neighborhoods). Since A < log 1//3 n, 
the target dimension of the snowflake embedding is d = log 1 / 3 n log log n. Since the aspect ratio of each 
neighborhood is O(logn) and the embedding has distortion O(A), each coordinate can be stored in b = 
0(log A + loglogn) = O(loglogra) bits. Therefore b 2 d = o(logn). It follows from Lemma[9](see Section f6. 1 j) 
that each vector may be stored in O(l) words, and the embedding distance between two vectors returned 
in O(l) time. Squaring the embedding distance gives a 0(A 2 ) approximation to the true distance. 

It remains only to locate a neighborhood containing both x" and y" , for which it suffices to locate x"'s 
ancestor in the lowest level Tj.i og ] ogn above the candidate range. A pointer to this ancestor can be prepro- 
cessed in time O(loglogn) per node. Given the correct neighborhood, a 0(A 2 )-approximation for d(x",y") 
can be found in O(l) time, and this yields a 0(A 2 ) = 0(A 3 )-approximation for d(x',y'). 

Step 3. The third step locates x',y' in O(l) time, under the assumption that a 0(A 3 )-approximation to 
d(x' , y') is known. 

As in Step 2, the 0(A 3 ) approximation to d(x', y') restricts the candidate levels of x', y' to a range of 
r = 31og 5 A + O(l) levels. The top level of this range is found using a level ancestor query, and then 
a constant number of log 5 A-jump queries locate explicit ancestors of x, y that are at most log 5 A levels 
below x',y'. Call these ancestors x",y", and let their level (or the level of the lower one) be Tj. Note 
that d(x",y") < d(x",y') + d(y',x') + d(x',y") < 1^+^^+0(1) + 6 . 5 i+4iog 5 x+0(l) + 4 5 i+4iog 5 A+o(i) = 
^ 5 i+41og 5 A+0(l) = o(X A 5 i ). 

In the preprocessing stage, we find for each explicit node u G Tj all nodes in levels Tj_ r _i og A through Tj 
whose distance to u is 0(A 4 5 4 ). For each such pair, we preprocess their lowest ancestral 6-neighbors, which 
can all be done in time 2°( AlogA ) time per point. Now, given x" and y" , their lowest ancestral 6-neighbors 
can be located in 0(1) time. 

4.2 Dynamic oracle 

In this section we give a dynamic version of the oracle. We prove the following theorem: 
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Theorem 5 There exists a dynamic (1 + e)- approximate distance oracle with expected 0(1) and worst- 
case mm {2°( A ),0(log 2 logn)} query time, and size [e °( A ) + 2°( AlogA )]n. The oracle can be maintained 
dynamically in 2°^ logn + e~°^ + 2°( AlogA ) time per point update. 

The dynamic oracle is given points x and y as a query, and runs the two backup oracles of Section [5] 
in the background. Between them, these oracles locate the lowest ancestral c-neighbors of x, y in p = 
min{2°( A ) , 0(log 2 log n)} time (Theorems [S] and 

The oracle itself searches for the lowest ancestral 6-neighbors x', y' of x, y. After locating these nodes, a 
log c-jump query can be used to descend in the tree to within a constant number of levels of the lowest 
ancestral c-neighbors of x,y. As before, The oracle locates x',y' in three steps, each of which can be 
implemented in 0(1) time: In the first step we use the probabilistic dynamic tree embedding of Section r6.3.1l 
to compute a ^^-approximation to d(x,y) and therefore to d(x',y'). In the second step we use the 
probabilistic snowflake embedding of Section [6.3.21 to compute an 0(A 2 )-approximation to d(x' ', y'). In the 
third step we locate x',y'. 

Step 1. The first step provides a ^(^-approximation to d(x,y), which implies a p°^ approximation 
to d(x',y'). We utilize the dynamic tree embedding of Lemma [131 with parameter i = log 5 w 4 p. The 
probability that the embedding fails to give the desired distortion is given as 0(l/p). Since the 

backup oracles run in time 0(p), the event of failure does not affect the target expected runtime of 0(1). 

Step 2. The second step gives a 0(A 2 )-approximation to d(x',y'), assuming an ^(^-approximation is 
already known. If A > p 0<yl \ this step is skipped. We therefore assume that A < p°^\ 

Recall that the p°^ l > approximation to d(x',y') restricts the candidate levels of x',y' to a range of r = 
0(log») levels (Lemma [3|). The top level in this range is provided by an LCA query on the dynamic tree 
of Section 16.3.11 Then a series of logp-jump queries locate in O(l) time explicit ancestors of x,y that 
are at most logp levels below x',y', which will suffice for our purposes. Call these ancestors x",y" - an 
0(A 2 )-approximation to d(x",y") yields an 0(A 2 )-approximation to d(x,y). 

Similar to what was done before, we preprocess the dynamic snowflake embedding of Section 16.3.21 for 
each non-empty neighborhood. Our target dimension for the embedding is d = O(logp), so it follows from 
Theorem O that the embedding achieves an 0(A)-approximation with probability of failure only 0(l/p), 
which does not affect the expected 0(1) runtime of the oracle. Since the aspect ratio of each neighborhood 
is 0(p) and the embedding has distortion O(A) = 0(p), each coordinate can be stored in b = O(logp) 
bits. Therefore b 2 d = o(logn), and it follows from Lemma[9]that each vector may be stored in O(l) words, 
and the embedding distance between two vectors returned in 0(1) time. Squaring the embedding distance 
gives a 0(A 2 ) approximation to the true distance. 

We then locate a neighborhood containing both x" and y" , for which it suffices to locate x' n s ancestor in 
the lowest level Tj.i ogp above the candidate range. A pointer to this ancestor can be recorded dynamically 
in time O(logp) per node insertion into T. Given the correct neighborhood, a 0(A 2 )-approximation to 
d(x",y") can be found in 0(1) time, and this yields a 0(A 2 )-approximation to d(x',y'). 

Step 3. The third step provides a constant factor approximation to d(x,y) in 0(1) time, under the 
assumption that we are provided a 0(A 2 ) approximation to d(x,y). The 0(A 2 ) approximation to d(x',y') 
restricts the candidate levels of x' , y' to a range of r = 0(log A) levels. We can ascend to the bottom level 
of this range via pointers from x" and y" , and these pointers can be maintained dynamically in O(logp) 
time per insertion into T. The rest of the construction for this step is identical to the third step of the 
static oracle, and can be done in 2°( AlogA ) time and space per node insertion into T. 
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4.3 Variant constructions. 



Here, we briefly discuss three variant constructions that appear in Table [TJ We show that these construc- 
tions can find the lowest ancestral 6-neighbors of x, y, after which the lowest ancestral c-neighbors of x, y 
can be found easily by using a level ancestor query or a /c-jump query. 

Construction 4 is achieved by first running the 0(logra)-approximate oracle of [TS]. As mentioned in Sec- 
tion this approximation restrics the candidate levels of the lowest ancestral 6-neighbors to O(loglogn) 
levels. Using level ancestor queries, a binary search finds the correct level in O(logloglogn) query time. 

Construction 5 is achieved by running the static construction until the end of Step 2. At the end of Step 
2, the range of candidate levels in O(logA), and a binary search on this range finds the correct level in 
O(loglogA) query time. 

Construction 9 runs the dynamic construction until the end of Step 2, at which point the range of candidate 
levels is reduced to 0(min{log A, log log log n}). A binary search on these levels can be executed using at 
most log log A different /c-jump trees, resulting in query time 0(min{log log A, log log log log n}). 

5 Backup oracles 

In this section, we present two dynamic oracles that find the lowest ancestral c-neighbors of points x, y. 
The maintenance of both oracles is bounded by the time to maintain a hierarchy. The first oracle answers 
query in time 2°^\ and the second in time 0(log 2 logn). While we have presented these constructions as 
backup oracles, it should be noted that they are contributions of independent interest. 

5.1 Dynamic oracle queries in 2°^ time 

In this section, we give a dynamic oracle that given x and y, finds their lowest ancestral c-neighbors in the 
hierarchy, thereby deriving a (1 + e)-approximation to d(x,y). We prove the following theorem: 

Theorem 6 There exists a dynamic oracle that given x,y G S returns a (1 + e)- approximation to d(x,y) 
in 2°( A ) time, and supports updates in time 2°^ log n + e~°^ . 

An overview of the construction is as follows. Given hierarchy tree T, we create a forest of 2°^ distinct 
trees. The difference between these trees lies solely in their parent-child relationship. We then show that 
in at least one of these trees, x and y have their lowest common ancestor at level log 5 d(x,y), or within 
O(l) levels of this level. By Lemma[3]this level is within 0(1) levels of the lowest ancestral 6-neighbors of 
x,y. 

Construction. We create a forest of distinct trees T = {T 1 , . . . , T e } in a manner similar to the creation 
of T. Each tree is built on top of the point hierarchy, so all trees share the same nodes and tree level sets. 
However, we ignore every odd level of the hierarchy, so the trees of T only have non-odd levels. Each point 
in Y 5 j (for non-odd j) corresponds to a unique node in tree level j of each tree T h . 

It remains to describe the parent-child assignments for the trees of T . A node u G Tj is assigned a single 
parent v G which covers u. Crucially, ties among candidate parents are not broken arbitrarily (as 
they were for tree T). Rather, each tree T h G T possesses a distinct set of dominant nodes in each tree 
level. Given a group of candidate parent nodes, the dominant node in the group always takes the child. We 
stipulate that the distance between dominant nodes Tj 1 must be greater than 2 • & ' , so that two dominant 
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nodes cannot vie for the same child. Further, we stipulate that a node in Tj is dominant in exactly one 
tree of T. Clearly, a forest of size can obey these stipulations. 

The dominance assignment can be implemented as follows: When a point x is added to hierarchical level 
Y 5 j, a corresponding node u is added to Tj 1 for each T h G T. In one of these trees, u is chosen to be 
dominant. (Note that by the packing property of doubling spaces, there must be at least one tree in which 
u is not within distance 2 • 5 J of any other dominant node in the same level.) In each tree T h G 7~, u is 
assigned as a child of the dominant node in T^ +2 that covers u, or of an arbitrary node of rj+2 ^ there is 
no dominant one. Note that once a parent-child assignment is made, the assignment cannot be reversed. 
Hence, a newly inserted dominant node does not become the parent of previously inserted nodes that it 
covers. (A reassignment would necessitate a cut-link operation on the tree, which is not supported by 
either [6] or [7].) The entire forest T can be maintain in time 2°^ per node insertion into T. The distance 
between u G Tj 1 and its ancestor w G T^ n is less than 5 m YaLq 25 ~* = 1-1/25 = H ' 5 ™- 

Oracle query Let x and y be two points such that 5 4_1 < d(x,y) < 5\ We prove the following lemma: 

Lemma 7 (%) For all T h G T, ^ e LCA of x and y in T h is in tree level i — 2 or higher, 
(ii) There exists at least one T h G T for which the LCA of x and y in T h is in level i + 1 or lower. 

Proof: (i) Consider an arbitrary tree T h G T, and nodes x' and y', the respective ancestors of x and y in 
Tj 1 . We have that d(x,x'),d(y,y') < §§ • 5 j . We further have that d(x',y') > d(x,y) - d(x,x') - d(y,y') > 
5 i_1 - 2§§ • 5 j = - f| • 5 j . Note that for j < i - 3 - log 5 (25/12), (or equivalently, j < i - 4) we have 
that d(x',y') > 2 • 5 J+2 , and x' and y' cannot be siblings. Hence, the LCA of x and y cannot be found in 
level i — 3 or lower and can be found in level i — 2 or higher. 

(ii) Consider an arbitrary tree T h G T, and nodes x' and y', the respective ancestors of x and y in 
tree level Tj\ Assume that x' was inserted before y' . There must exist some covering point z G 
for which d(z,x),d(z,x') < §5 :,+2 . If there exist more than one point satisfying this condition, let z 
be the first inserted point satisfying the condition. Also recall that d(y,y') < || • 5 J . We have that 
d(z, y') < d{z, x) + d(x, y) + d{y, y') < f ■ 5^'+ 2 + 5* + § • 5* = j§ • 5^'+ 2 + 5*. Now let T z denote the tree 
in which z is dominant, and let nodes x' and y' be the respective ancestors of x and y in T z . Note that 
for values j > i — 2 — log 5 (19/120) (or equivalently, for values j > i), we have that d(z,y') < 5 J+2 , and so 
x' and y' are both children of z in T z (or are in fact the same point). This implies that x and y must be 
descendants of z. Hence, x and y must have a common ancestor in level i + 1 or below. □ 

The query proceeds by executing an LCA query for x and y in each tree of T ■ We select the lowest node 
among the ancestors returned from these LCA queries, say v G Tj\ By Lemma [3l this level is within a 
constant number of levels of the lowest ancestral 6-neighbors of x, y. Given v, the ancestors of x, y in Tj 
can be located in 2°^ time, and a logc-jump query then locates nodes within a constant number of levels 
of the lowest ancestral c-neighbors of x, y. 

5.2 Dynamic oracle queries in O (log 2 log n) time 

In this section, we give a dynamic oracle that given x and y, finds their lowest ancestral c-neighbors in the 
hierarchy, thereby deriving a (1 + e)-approximation to d(x,y). We prove the following theorem: 

Theorem 8 There exists a dynamic oracle that given x,y G S returns a (1 + e)- approximation to d(x,y) 
in 0(log 2 logn) time, and supports updates in time 2°^ logn + e~ olyX \ 
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We begin by presenting a solution for the static version of the problem and later show how to adapt this 
solution to the dynamic environment. We will make use of the point set S and tree T. 

Static construction. Recall that given x and y it is sufficient to find the lowest ancestral c-neighbors of 
x and y in order to answer the query. This problem could be solved by a simple traversal, in parallel, on 
the paths in T upwards from x and y. At each level we check whether a(x), the ancestor of x, and a(y), the 
ancestor of y, are c-neighbors, and the first encountered c-neighbors are the lowest ancestral c-neighbors. 
(Note though that some ancestors may not be explicit in certain levels.) This method may require 0(n) 
time. 

To improve this runtime, we note that the hereditary property, Property [2] implies that a binary search 
can be used. This binary search can be implemented via level ancestor queries on T (see Section [2]), and 
reduces the query time to O(logn). 

To further improve the query time we use a centroid path decomposition C of T. A centroid path decom- 
position partitions the tree T into a collection of centroid paths in the following way. The size of a node u 
(s(u)) is the number of nodes in the subtree rooted at u. Each centroid path has an associated power of 
2, say 2 i , and all nodes on the path have size 2 l < s(u) < 2' +1 . A node u is on the same centroid path as 
its parent if their sizes are both between 1 % and 2* +1 for some i. 

A well-known property of centroid path decompositions is that for any node u, the path from u to the tree 
root traverses at most logn centroid-paths (along their prefixes). To utilize this we create a centroid path 
tree that contains a node for each centroid path. The centroid path tree has an edge from centroid-path- 
node p to centroid-path-node p' if u, the head of the path p', is a child (in T) of a node on p. It follows 
from the path-decomposition property that the height of the centroid-path tree is O(logra). 

To speed up the queries we first perform a binary search along the path from x-to-root considering only 
the O(logn) heads of the centroid paths on the x-to-root path. This is done by using the centroid path 
tree and level ancestor queries on the centroid path tree. The nodes evaluated are compared with to 
counterparts (in the same level) in the y-to-root path in T, to see if they are c-neighbors. The node on 
the y-to-root path on the appropriate level can be found using a level ancestor query (in the tree T). This 
search determines which pair of centroid paths (one overlapping the path of x-to-root and one overlapping 
the path of y-to-root) contains the nodes that constitute the lowest ancestral c-neighbors. However, these 
paths themselves may be of size 0{n). Therefore, we preprocess the following information: We create a 
centroid path graph with the same node-set as the centroid path tree and an edge between two centroid- 
path-nodes if their paths contain any nodes that are c-neighbors. The edges are weighted with the lowest 
level on which there exist c-neighbors on these paths. Trivially, the centroid path graph is not larger than 
T, and can be preprocessed in the same time. Once this graph exists, the extraction of the lowest ancestral 
c-neighbors is immediate. 

Static query time. The time to binary search the centroid path tree is O(loglogn) as the height of any 
path (in the centroid path tree) is O(logra). Note that although we binary search on both paths, these 
searches are done one after the other and, therefore, the time is still O(loglogn). Once the two centroid 
paths that contain the lowest ancestral c-neighbors are found then we in O(l) we can obtain the lowest 
ancestral c-neighbors because of the preprocessing. 

Dynamic construction. Now consider the dynamic version of the query problem. A dynamic version 
of the above search encounters the following problems (1) level ancestor queries are not supported in this 
setting, and (2) the centroid paths, centroid path tree and graph must be maintained. 
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Recall that the level ancestor query was used twice, upon T and upon the centroid path tree. We will show 
how to remove the query on T and how to circumvent the level ancestor query upon the centroid path tree. 

First, we consider the problem of a dynamic centroid path decomposition. We will use the method from 
[SJ El [16] . The general idea of the method is a lazy approach achieved by changing the size constraints of 
the centroid paths to have nodes of size between 2 i and 3 • 2 i+1 . This gives the necessary time to (lazily) 
update the centroid path decomposition with worst case O(l) time per change. 

Consider the centroid path tree. Define a directed edge from a leaf to an ancestor to be an ancestor 
edge. We change the centroid path as follows. The node set, i.e. a node per centroid path, remains the 
same. However, the edge set is changed to be the collection of all ancestor edges. We note that it follows 
directly from the lazy approach method for the centroid paths that maintaining the ancestor edges under 
the dynamic changes is possible with the same lazy approach. Hence each update can be implemented in 
0(1) time. Unfortunately, the number of edges in the centroid path tree blows up to 0(nlogn) instead of 
the original n. However, this can be corrected by binarizing the tree T and using indirection on the tree 
in a method described in 0[16]. The idea follows along the following lines. 

The tree T is partitioned into a collection of trees CT of size O (log re) such that every node of T is in CT 
and an edge of T is in CT if it connects two nodes in the same tree in CT. The property of this partition 
is that each tree in CT has at most two other children trees of CT. A skeleton tree T containing the roots 
of the CT-trees as nodes and children-parent edges according to the CT tree relationship are created. See 
[T6l Section 6], for details of this skeleton tree and its dynamic handling. Obviously the size of the skeleton 
tree is Oinj log n). We will use a centroid path decomposition on the skeleton tree and create accordingly a 
centroid path tree. The centroid path tree can now handle the dynamic changes and searches and maintain 
a size of 0(n). 

A change needs to be made to the centroid path graph as well. Note that the centroid path graph, as 
opposed to the centroid path decomposition and centroid path tree, is unique to this problem. Beforehand, 
two centroid paths had an edge between them if there was a c-neighbor pair. We slightly change this 
definition such that two nodes (both in the skeleton tree) will be c-pseudo-neighbors if one of them is a 
c-neighbor of a node in the CT tree of the other. In the centroid path graph two centroid paths will be 
neighbors if there are a pair of nodes that are c-pseudo-neighbors. The weight of the edge, similar to before, 
will be the level of the lowest level for which we have a pair of c-pseudo-neighbors (the level is defined 
according to the node with the lower level). 

Finally, we need to replace the level ancestor query which we used upon T. This query was done when we 
had an ancestor of x which was the head of a centroid path p on some level, say j, and we needed to find 
its counterpart, i.e. the ancestor of y on level j, to see if they are c-neighbors. The replacement will be a 
binary search on the path from y to root in T along the heads of the centroid paths. This is done until 
we are in the position where we have two centroid paths p' and p" on the y-to-root path, where p" is the 
son of p' in the centroid path tree and where the level of the head of the path of p' is > j and the level of 
the head of p" is < j. It can be verified that the counterpart of the ancestor of x is in a CT tree whose 
root is on the centroid path p' and hence if the ancestor of x and it's counterpart are c-neighbors then 
the ancestor of x and the root of the CT tree (containing the counterpart) are c-pseudo-neighbors. Hence, 
there is an edge (p,p') in the centroid path graph. Conversely, if there is an edge (p,p') because the level 
of the head of p' is lower than the head of p it follows from the hereditary property that the mentioned 
ancestor of x and its counterpart must be c-neighbors. Finding the lowest ancestral c-neighbors is done by 
finding the lowest pair of nodes (which are CT tree roots that are c-pseudo-neighbors). Then one needs to 
extract the appropriate node from the CT tree of one which is on the level of the root of the other. This 
can be done with a simple scan in the CT-tree. 



12 



Dynamic query time. A binary search on the path of x can be done in O(loglogn) as in the static 
case. However, for each step in the binary search on the path of x, we must execute a binary search over 
the path of y, in order to locate the ancestor of y in the correct level. Now, there is the additional step 
of moving from c-pseudo-neighbors to c-neighbors in order to find the lowest ancestral c-neighbors may 
cost O(logn) time because of the size of the CT tree. However, if we recurse the above-described method 
partitioning each of the CT trees then we will have small-CT trees of size O(loglogn) and extracting the 
appropriate node will take only another O (log log n) steps. 

6 Technical contributions 

In this section we present technical constructions utilized by the distance oracles. 
6.1 Euclidean distance oracle 

The following lemma utilizes atomic word operations to find the exact distance between (sparse) Euclidean 
points in 0(1) time. 

Lemma 9 Let S be a dynamic set of d- dimensional vectors, where each coordinate is a b-bit number. If 
b ■ d 2 = O(logn), then there exists a vector representation of points in S that 

(i) Constructs each vector of O(l) words in 0(1) time. 

(ii) Allows the £2 distance between any point pair p,q G S to be computed in O(l) time. 

Proof: Let pt be the i-th coordinate of (i-dimensional point p £ S, and recall that the £2 distance between 
two points p,q G S is defined as \\p - q\\ = J2i=o(Pi ~ n) 2 = Ya=o Pi ~ Ya=o PiQi + Ya=o <£■ lt suffices 
to show that there exists a vector representation for all points p,q G S that occupies 0(1) words per point 
and allows the sum Yli=o PiQi *° be computed in 0(1) time. 

Assume without loss of generality that d is a power of 2, and for the sake of simplicity, assume that 
4b ■ d 2 < [log n] , so that all operations below can be done on a single word. We pad each vector with d 
additional coordinates (each of b bits all set to 0), resulting in 2c?-dimensional vectors. 

For each point p G S, we create two vectors u p and v v . Vector u p is constructed as follows. Every 
coordinate of p is stored at the rightmost position of a range of r = 2b bits, with coordinate pi stored in 
bits [ir, . . . ,{i + l)r — 1] for all < i < 2d (numbered from the right end as usual), with all unused bits 
set to 0. Vector v p is constructed as follows. Every coordinate in p is stored in the rightmost position in a 
range of r' = 26 • d bits, with coordinate pi stored in bits [ir 1 , . . . , {i + l)r' — 1], with all unused bits set to 
0. 

Now take points p,q G S, and compute in 0(1) time the product w = u p x v q . Note that for all i, piqt is 
found in r consecutive bits beginning at position i(r' + r) of w. Set to all bits of w that do not correspond 
to a product p%q%, that is all bits not in the range \i(r' + r), i(r' + r) + r] for all i. (This can be done using 
bitwise AND with a fixed number.) We are left with vector w that contains exactly one copy of each 
product Piqi. It remains only to sum these entries in O(l) time. 

To this end, let s be a vector that has a 1 in the i(r' + r)-th bit for every < i < 2d and elsewhere. Let 
y = w x x. The sum of the entries of w is found in 2b + logd bits beginning at position (r' + r)(d — 1) of 
y. □ 
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6.2 Dynamic jump tree 



In this section, we will describe a dynamic structure that supports jump queries. The compressed hierarchy 
tree T was described in Section [3j We now describe fe-jump queries on the tree T. 

Definition 10 A k-jump query on compressed hierarchy tree T provides two explicit tree nodes, u £ Y 5 i 
and its ancestor w £ Y^v. Let m be the largest value less than p which is a multiple ofk. The query requests 
the node v £ Y$™ that is ancestral to u; if v is implicit then its lowest explicit ancestor is requested instead. 

The existence of a dynamic structure supporting jump queries would allow us to descend T via jumps. 

Lemma 11 For fixed k, a structure that supports k-jump queries of hierarchy tree T can be maintained 
along with 0(k) work per insertion to T and 0(\T\) space. 

Before presenting a proof of Lemma [TT\ we first need a preliminary lemma that extends the dynamic LC A 
structure of Cole and Hariharan [7j. 

Lemma 12 For any tree T, there exists an LCA query structure that supports insertion of leaves and 
internal nodes to T in 0(1) time, and answers the following query in 0(1) time: given nodes u,v E T, 
return w = lca(u,v) £ T as well as the children u',v' £Tofw that are the respective ancestors of u and v. 

Proof: Given tree T, we create a new tree T\ as follows. Let r be the root of T and let vq, Vf be r's 
ordered children. The root of the tree T\ is r. r's left child is vq, and for all nodes 1 < i < f we have that 
Vi is the right child of We then recursively build the subtrees of each child node Uj. This tree can be 
maintained in 0(1) time for each update to T. Now consider nodes u,v £ T that have w = lc&(u, v) £ T, 
and consider the nodes u',v' £ T that are children of w and the respective ancestors of u,v £ T. Assume 
that u' precedes v' in the ordering of the children of w £ T. Then by construction, an LCA query on 
u, v £ Ti returns u' £ T\. 

It remains to identify v' . To this end, we create tree T2 as follows: The root of the tree T2 is r. r's left 
child is Vf, and for < i < f we have that Vi is the right child of Uj-i- We then recursively create the 
subtrees of r's children V{. Now consider nodes u,v,w,u' ,v' £ T mentioned above. By construction, an 
LCA query on u, v £ T2 returns v' £ T^. □ 

We can now proceed in the proof of Lemma [TTJ 

Proof: Let tree T' preserve every k-th level of T. We build T' from T as follows: Level Tj contains a copy 
of every uncompressed node of level Tj.k, j = 0, . . . ,00. Further, Tj contains a copy of every compressed 
node x in Tj.k that has its lowest uncompressed ancestor y in some level below Tj.^k+i), and x is given a 
pointer to y. This can easily be done in 0(k) time per tree update. (Note that the compression scheme 
implies that u is the only descendant of v in level Tj.k.) The ancestor-descendant relationship in T' is 
defined by the anscestor-descendant relationship in T. 

Now given a /c-jump query for nodes u,w £ T, we first locate the lowest respective ancestors u' , w' of u, w 
whose tree level is divisible by k. (This information can be maintained for each node in 0(k) time.) The 
LCA query of Lemma [T2l on u',v' £ T f , where v' is a child of w' £ T' which is not an ancestor of u' £ T', 
returns w' as well as the child u" £ T" of w' £ T'. u" £ T (or if it is compressed, its lowest uncompressed 
ancestor) is the desired node. □ 
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6.3 Dynamic embeddings 



Here we present two randomized dynamic embeddings for an ra-point metric space (S, d) with doubling 
dimension A. Both embeddings store 0{n) interpoint distances and each can be maintained in time 
2°( A ) minjlog n, log a} + per update (where I > 5 is a parameter specific to each embedding). 

• The first embedding is into a tree metric, with / = 0(A 2 ). Let T be the target space of the embedding. 
Given two points x,y £ S, we show that dx(x, y) > d(x, y) (that is, the embedding is non-contractive), 
and that dx{x,y) > [0(X)] l d(x, y) with probability at most (4/5A) 1 (for any positive integer i). 

• The second embedding is a snowflake embedding into £2, with I = 0(1). Let E be the target space 
of the embedding. Given two points x,y E 5, we show that < 1 (that is, the embedding 

is non-expansive to the snowflake), and that ^^~ji/2^ 2 > 2 _11 /A with high probability. 

Both embeddings are build upon the hierarchy of [6], after a new assignment of parent-child relationships 
to the hierarchical points. 

Parent-child assignment. We restrict ourselves to consider each |~log 5 Z]-th level in the hierarchy. (For 
ease of presentation, we will henceforth assume that I is a power of 5.) With regards to this restricted 
hierarchy, a repeated application of the covering property gives that every point in level H 5 n OS5 i = H t i 
is within distance §Z l+1 of some point in level flji+i, and this constitutes the covering property for the 
restricted hierarchy. 

Let x € Hn be a newly inserted point occurrence in the hierarchy. As in [T], we associate with x a radius 
r x £ [l l ,2l l ], where r x is a random variable sampled from a truncated exponential density function: The 
density function is f(r) = 1 _ A ^_a pe~ pr with parameter p = 21n(A 4 )/r when r G [Z l ,2Z*], and is /(r) = 
elsewhere. (This is the construction presented in pQ with parameter A = 4Z\) Then x is the parent of 
all subsequently inserted point occurrences in level -ff^-i within distance r x of x, unless those points are 
within the radius r y of a point y S that was inserted before x. This defines the parent-child relationship 
in the restricted hierarchy. 

The hierarchy stores 0(n) interpoint distances, and can be maintained in 2°^ min{logn,loga} + 2 ( AlogA ) 
update time. 

6.3.1 Tree embedding 

Here, we present a dynamic embedding of S into a tree metric. We use the hierarchy and parent-child 
relationships delineated above, with Z = 37A 2 . We extract a randomized tree from the hierarchy as follows: 
For each point occurrence in the restricted hierarchy, there exists a single corresponding node in the 
tree. Hence, the parent-child relationship among the restricted hierarchical points immediately defines a 
parent-child relationship in the corresponding tree, where an edge connect a parent to its child. From the 
randomized tree, we extract a tree metric <ir( - > •) by assigning a length to each edge: An edge rooted at 
level H d i is assigned length (4Z)\ We have the following lemma: 

Lemma 13 For any two points x,y £ S and positive integer i, where I = 37A 2 , 

• d T {x,y) > d(x,y). 

• Pr[^gf > f (4J)H-l] < (f x Y- 
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Proof: Consider any two points x,y £ S, or equivalently the corresponding point occurrences x,y E Hq. 
We first show that the tree embedding is non-contractive: If x and y have their least common ancestor 
in level i > of the tree, then by construction dr{x,y) > 2 • 4T > 8l\ while d(x,y) < 2^ =1 2Z J < 8l\ 
Hence, the embedding is non-contractive. 

Next, we derive a probabilistic upper bound on the expansion of the embedding: Let l k ^ 1 < d(x,y) < l k . 
Then the true distance between the hierarchical ancestors of x,y in level Him, m > k, of the restricted 
hierarchy is less than d(x, y) + 2| X}j=o ^ < ^ + ir^ m < ^-l m - By the covering property of the restricted 
hierarchy, x is covered by some point x' € fljm+i for which d(x,x') < |£ m+1 , and so a simple computation 
gives y's covering point y' € Hjm+i also falls within the radius r x i of x': d(x',y) < d(x',x) + d(x,y) < 
4^m+i _|_ 3£jm < /' m + 1 . Now, the probability that the respective ancestors of x and y in level Him do not 

share the same parent is bounded by 4A ^ +1 = ^ pQ. Hence, the probability that x and y have their 
lowest common ancestor at level Hp+i is bounded by (4/5A) 1 , in which case dx{x,y) < 2|^ J ^|(4Z)- J < 
f (Al) k+i < d(x,y) ■ f (4Z) m . □ 



6.3.2 Snowflake embedding 

In this section we give a dynamic Assouad style embedding [3], in which for a given metric space (S,d) 
we embed the snowflake (S, d 1 / 2 ) of the metric into I2 space. Our theorem can be viewed as a dynamic 
version of the theorem of \V2\ \T\. (A similar embedding holds for d 13 with < (3 < 1 and for general target 
space l p .) For simplicity we focus on the probabilistic version of the theorem which bounds the distortion 
with constant probability. 



Theorem 14 For any n "point metric space (S, d) with doubling dimension X, there exists a non-expansive 
probabilistic embedding f : S 1— )■ E, E C I® , that realizes the snowflake (S,d 1 / 2 ): For every pair x,y G S: 



Pr 

f:S^E 



\\(f(x)-f(y)h 

d(x, y) 1 / 2 



< 2- n /A 



< e 



-D/16 



Moreover, this construction can be computed dynamically with storage of 0(n) interpoint distances and 
2°( A ) minjlog a, log n} update time. 



Our embedding uses the same hierarchy and parent-child relationship presented above, with I = 8. Let 
.ff;i -cluster C x be composed of all descendants of x S H X i, and call x the center of this cluster. It follows 
that each point is found in O(loga) clusters, one cluster for each level of the hierarchy. Let C(P, y) denote 
the -f^i-cluster containing y. 

As usual for the construction of snowflake embeddings, we shall construct the embedding function / by 
defining for each integer 1 < t < D a function /w : X — > M. + , and then letting / = Z?^ 1 / 2 © 1<t<D f^'- 

Fix t, 1 < t < D, and in what follows we will define f^: For each restricted hierarchical level -ffy we 
define a function /f ) : S -> K+, and for each point x G 5, let = /^(a?)- Let {af\c x )\x E ^} 

be i.i.d. symmetric {0, l}-valued Bernoulli random variables. Let r = In2/(8A) > 2~ 4 /A. The embedding 
is defined as follows: for each x £ S, 

• For each i, let ff\x) = af\c{l\x)) ■ l' 1 / 2 mm{T~ l ■ gi{x),P}, 

where gi(x) is a function which computes the distance from x to the boundary of C(l l ,x). This can be 
computed as follows. Let v be the center of C(l l ,x) and let U be the set of -Hp-cluster centers within 
distance 4l l of v which were inserted into S before the insertion of v. For u in U let ri(u) denote its 
associated radius. Then: 
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• gi(x) = min{r.j(-u) - d(v,x),mm u€U (d(u,x) - r^u))}. 

The function gi(x) replaces the expression d(x,X \ Pi(x)) used in embedding of pQ. (Note that g%{x) is 
not affected by the insertion of new points into the hierarchy, and can be evaluated in time 2°( x \) The 
following properties are needed to show that it can be replaced in their analysis: 

Claim 15 For every x,y E X: 

• IfC{l\x) = C{l\y) then \g t (x) - 9i (y)\ < d(x,y). 

• If C(l\x) ^ C{l\y) thenmax{gi(x),gi(y)} < d(x,y). 

• 9i( x ) > P with constant probability. 

Proof: 

• To prove the first claim is clear from assume that gi(y) is minimized for some u E U, then: gi{x) — 
9i{y) < (d(u,x) — ri(u)) — (d(u,y) — ri(u)) < d(x,y). If gi(y) is minimized for v a similar argument 
applies. Similarly, g^y) - g^x) < d(x,y). 

• We prove the second claim by contrary assumption that d{x,y) < gi(x). It follows that d(x,y) < 
riiv) — d(v,x) which implies that d(v,y) < d(v,x) + d(x,y) < ri(v). Also for each u £ U, we have 
d(x,y) < d(u,x) — ri(u) which implies that ri(u) < d(u,x) — d(x,y) < d(u,y) but together these 
inequalities imply that y E C(l l ,x) which is a contradiction. 

• As a consequence of the analysis of [I], we have with constant probability that d(v,x) + p < ri(v) 
and also for every u E U, d(u,x) — p > ri(u). It follows that gi(x) > p with constant probability. 

□ 

Given Claim [J5] the analysis of pQ implies the following: 

Lemma 16 For any (x,y) E X and t E [D]: 

\fW( x )-fV(y)\<2 7 \.d(x,y) 1 / 2 . 

Lemma 17 For any (x,y) E X, with probability at least 1 — e D ^ 16 : 

\\f(x)-f(y)\\ p >2^.d(x,y) 1 / 2 . 

Proof: It follows from the Assouad-type argument that with probability at least 1/8: 

\f {t) (x)-fV(y)\ >2- 3 -d(x,y) 1 / 2 . 

The lemma follows from applying a Chernoff bound. □ 

The theorem follows from an appropriate scaling of the embedding so to achieve a contractive embedding 
with the required properties. 

Acknowledgements. We thank Richard Cole, Robi Krauthgamer, Manor Mendel and Michiel Smid for 
helpful conversations. 
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